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The present invention relates to a method for modifying the genome of cukaiyotic cells by homologous recombination using DNA 
sequences which substantially differ from the target locus with respect to the nucleotide sequence (0,1 to 30 % divergence) in the region 
where recombination takes place. Homologous recombination between diverged sequences is achieved by genetic or transitory inactivation 
of the cell's mismatch repair system. 
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HOMOLOGOUS RECOMBINATION IN MISMATCH I^EPAIR INACTIVATED EUKARYOTIC CELLS 



BACKGROUND OF THE INVENTION 
Introduction 

The introduction of specific modifications in the prokaryotic and eukaryotic genome is a 
powerful tool In studying gene function both at the level of individual cells and in the context 
of a complete organism. In addition, modification of specific genes may result in the 
generation of industrially and medically important organisms, whereas correction of defective 
alleles in eukaryotic cells may provide a substantial step forward in the development of 
somatic gene-therapy protocols. 

The method of genetic modification relies on the ability of virtually every celt type to 
exchange DNA sequences with a high degree of nucleotide sequence similarity by a 
process which is called homologous recombination (1). Briefly, the method of genetic 
modification involves the generation of a so-called targeting construct, a DNA sequence that 
is largely identical to the specific chromosomal locus to be modified, but differing from this 
locus by specific modifications. These modifications can be as small as the deletion, 
insertion or substitution of a single base-pair, or be as large as the deletion or insertion of 
ten's of kilobasepairs. On entry of the targeting construct into the cell, exchange of 
sequences flanking the modification with their chromosomal counterparts, will result in the 
introduction of the modification into the recipient chromosome. 

The efficiency of homologous recombination in both prokaryotes and eukaryotes strongly 
depends on complete sequence identity of exchanging DNA strands (2-5). Thus, sequence 
dissimilarities as small as 0.5% can already strongly impede homologous recombination. In 
several bacterial species, Escherichia coli, Saimonelia typhiniurium and Streptococcus 
pneumor)iae as well as in the yeast Saccharomyces cerevisiae it has been unequivocally 
shown that the DNA mismatch repair system is responsible for suppressing recombination 
between homologous but nonidentical DNA sequences (6-8). 

We have now demonstrated that in mammalian cells deficient for the DNA mismatch repair ' 
gene Msti2, homologous recombination has lost the requirernent for complete sequence 
identity between xchanging DNA sequences. This finding provides a new method for 
modifying the eukaryotic genome using DNA targeting constructs which substantially differ 
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from the target locus in the region where recombination takes place by genetira^^ 
functionally inactivating the cell's mismatch correction system. 

. The method obviates the requirement for DNA targeting constructs that are largely identical . 
to the target locus, thus allowing the efficient genetic modification of both somatic cells and / 
the germ line of outbred organisms. This may provide a route to somatic gene therapy and . 
the modification of eukaryotic species of which inbred strains are not easily available. Also, 
genomic sequences can be replaced by small oligonucleotides carrying one or more, 
basepair alterations or by large chromosomal sequences derived from other species. The 
method can also be used to generate large deletions by intra- or extrachromosomal 
homologous recombination between repeated but diverged sequences. . . ^ 

Genetic modification of mice 

The introduction of specific genetic alterations into the germ line of mice was made possible 
by a combination of new techniques in molecular biology and embryology that have 
become available during the last ten years (9-1 1 ). The method entails the introduction of a 
planned genetic modification in embryonic stem (ES) cells by homologous recombination. 
ES cells are originally derived from the inner cell mass of 3.5 day old pre-implantation 
embryos and can be maintained by /n v/Yro culture as immortalized cell lines, retaining the 
undifferentiated state under appropriate culture conditions. At present, a number of ES cell 
■wia^^^— ^s^^^iirvesf areyavailabii^ ^m^ h^- 
embryos can efficiently compete with the inner-cell-mass cells of the recipient blastocyst in 
embryonic development thus generating a chimeric mouse consisting of cells derived from 
the recipient blastocyst and the injected ES cells. Should the in vitro modified ES cells 
contribute to the germ line of chimeric animals, the ES cell genome can be transmitted to the 
next generation giving animals carrying the introduced modification in all their cells.. 
Intercrossing of such animals reveals the phenbtypic consequences of homozygosity of the 
modified gene. 

The most commonly introduced genetic modification in ES cells is gene disruption leading to 
gene inactivation. This allows to study gene function by analyzing the consequences of the 
absence of a particular gene in the context of a complete orgariism. Also, many hereditary 
human diseases result from hemizygosity of specific genes [e.g. cancer predisposition 
syndromes like hereditary nonpolyposis colorectal cancer (12), Li Fraumeni syndrome (13). 
retinoblastoma (14)]. The generation of mouse models for such diseases by disruption of 
the mouse homologs of the genes involved provides an invaluable tool for studying 
hereditary diseases in an experimental setting. 
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Gene disruption in ES c lis 

Thelnactivation of a specific gene in ES cells via homologous recombination starts with the 
preparation of a DNA targeting construct (15). Two types of targeting constructis can be 
used. In a replacement-type vector, a drug-resistance marker gene (conferring to the cell 
resistance to dmgs like G418, Hygromycln B, Puromycin, Histidihol) disrupts a sequence 
which is homologous to a sequence in or around the target gene in the recipient genome; in 
an insertion-type vector the marker gene flanks the homologous sequence. The targeting 
construct is Introduced into ES cells by electroporation (or alternatively by Ca-Phosphate 
precipitation, lyposome-mediated DNA transfer or micro-injection) and cells are selected for 
stable integration of the targeting construct into the recipient genome by growth in medium 
containing the appropiate drug. Drug-resistant colonies are the result of either one of two 
events: integration of the targeting construct at a random site in the genome or integration of 
the marker gene in the target locus via homologous recombination between the flanking 
sequences and their chromosomal counterparts. The replacement-type vector integrates by 
homologous recombination on both sides of the marker gene; the insertion-type vector 
integrates by a single homologous recombination event leading to duplication of the region 
of homology. Thus, the marker gene serves two purposes: it allows selection of cells that 
have taken up the targeting construct and» on integration via homologous recombination, it 
will disrupt the target gene thereby modifying its function. 

Distinguishing ES cell clones resulting from homologous recombination from those resulting 
from random integration, requires the DNA of individual clones to be analysed by Southern 
hybridisation or the polymerase chain reaction. 

Unfortunately, In many targeting experiments, random integration was often found to be far 
more efficient than homologous recombination and also large variations in targeting efficiency 
were observed for different genes (1 6). In this respect, mammalian cells differ from bacteria 
and lower eukaryotes like yeast (17), Leishmania major (^8) or Trypanosoma brucei{^9), 
where integration of exogenous DNA into the recipient genome exclusively or predominantly 
occurs via homologous recombination. To date, three factors clearly affecting the recovery of 
homologous recombinants in mammalian cells have been identified. First, the frequency of 
homologous recombination increases substantially with the total length of the homologous 
sequences up to 14 kilobase-pairs (20). Second, the expression level of the marker gene at 
the target locus affects the frequency of recovery of homologous recombinants: low 
expression may lead to loss of homologous recombinants, whereas high expression may 
allow selection of homologous recombinants at an elevated drug concentration (21). Third, 
sequence dissimilarities between the targeting construct and the chromosomal target locus 
strongly suppress the efficiency of homologous recombination (5). 
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High efficiency targ ting with isog nic DNA constructs 

The" suppression of homologous recombination in ES cells by small sequence dissimilarities 
became clear by a gene targeting experiment aimed at disrupting the Retinoblastoma gene 
with a neomycin resistance marker gene. Two targeting constructs were prepared carrying 
the neomycin resistance marker embedded in 1 0.5 kb of Rb sequence. In one construct the 
Rb sequence, was derived from mouse strain 129/ and was therefore identical to the 
corresponding chromosomal locus in the ES cells, which v^ere also derived from mouse 
strain 129. In the other construct, the Rb sequence was derived from mouse strain BALB/c. 
The two constructs, designated 129Rb-neo and B/cRb-neo are described in the 
accompanying publication Proc. NatL Acad. Sci. USA, Vol. 89, pp. 5128-5132 on pages 
5128 and 5129, Fig. 1 .The tvvo constructs contained corresponding flb sequences and ;% ' 
were therefore largely similar. However, they differed approximately 0.6% at the nucleotide ^ 
level (which corresponds to the level of sequence polymorphism found in the hurnan 
population). Thus, in a stretch of 1687 basepairs that was sequenced, the BALB/c 
sequence differed from the 129 sequence. by 9 basepair substitutions, three small deletions , 
(of 1 .4 and 6 nucleotides) and two polymorphic CA-repeats (see Proc, NatL Acad. ScL 
USA, Vol. 89, pp. 5128-5132, page 5130, Fig. 4). On introduction of these constructs in 
129-derived ES cells, homologous recombination at with the 129-:derived construct was 
50-fold more efficient than with the nonisogenic BALB/c-derived construct. To provide ^ . . 
additional evidence that the suppressiph of recombination was solely dependent on the 
ft^^f5SiyrrtafrFiRiSff^ 

experiment was performed. I.e. targeting of a BALB/c-derived ES cell line with the 1 29- and 
BALB/c-derived constructs. This experiment yielded the inverse result, I.e. a higher 
targeting efficiency with the BALB/c-derived construct than with the nonisogenic 129- . 
derived construct 

With a somewhat different targeting construct, consisting of a hygronriycin resistance gene 
embedded In 17 kb of Isogenic f?5 DMA, we observed that 80% of all Hygromycin B^ 
resistant colonies resulted from homologous recombination (5). This demonstrates that, in . . 
the presence of perfect homology, also in mammalian cells honiologous recombination rather 
than random integration can be the predominant event. 

Although clearly not all problems of gene targeting have been solved, many genes have - 
now been successfully targeted by the use of isogenic DNA targeting constructs. However, 
genetic modification of cells den/ed from an outbred organism can become a difficult 
endeavour as isogenic targeting constructs are not easily available. In this case, efficient 
gene targeting would require the targeting construct to be prepared from DNA derived from 
the target cell. Especially in the context of gene therapy, this would raise a tremendous 
practical obstacle to correction of a defective gene. Also, base sequence divergence 
imposes a major barrier to exchanging a large chromosomal region of one species by the 
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syntenic region of another species. The present invention provides a way to overcome 
these problems. 

The introduction of subtle mutations 

Although protocols for disruption of genes in inbred ES cell lines (and in somatic cell lines of which 
isogenic DNA targeting constructs can be prepared) are rather well developed, the introduction of 
more subtle mutations is not straightforward. Current protocols are variations on a two-step 
procedure in which first a marker gene is introduced into the target gene followed by replacement of 
the marker gene by the desired subtle mutation (5). This procedure requires the marker gene to t>e 
selectable both for its presence (first step) and its absence (replacement by the subtle mutation). 

Useful marker genes are the Hprt rninigene to be used in Hp/t-deficient ES cells (positive selection 
in HAT medium; negative selection by 6-thioguanine) and a combination of the neomycin resistance 
gene (positive selection by G418) and the Herpes Simplex Virus thymidine kinase gene (negative 
selection by Gancyclovir). In an alternative procedure, the subtle mutation and the marker gene are 
present on the same targeting construct and concomittaritly introduced into the genome by 
homologous recombination. If an insertion-type vector was used, the marker gene can be removed 
by intrachromosomal recombination between the duplicated sequences that were generated during ' ■ 
the first integration event (23). In case of a replacement-type vector, the marker gene can be 
removed if it was flanked by two site-specific-recombination sites (e.g. loxP sites). Recombination 
between these sites on introduction into the ceil of the /oxP-specific recombinase Cre will remove 
the marker gene from the genome (24). 

Although either of the above mentioned procedures has allowed the subtle modification of a number 
of genes in ES cells,,they are highly demanding as to the generation of appropiate DNA targeting 
constructs and the cuituring of ES cells under various selective conditions. 

Therefore, an attractive alternative to these procedures might be the use of small single- or double- 
stranded oligonucleotides (up to 100 bases or basepairs), which are identical to the target locus 
except for one or several basepair alterations. However, our finding that base sequence 
dissimilarities as small as 0.6% strongly suppress homologous recombination, may impose a major 
impediment to using such oligonucleotides for the introduction of subtle genetic modifications. The 
present invention provides a way to overcome this problem and may allow the subtle modification 
of cell lines and cells derived from living organisms and temporarily cultured in vitro. 
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GENETIC MODIFICATION OF EUKARYOTIC CELLS USING DNA CONSTRUCTS 
WITH SUBSTANTIAL BASE SEQUENCE DIFFERENCES WITH RESPECT TO THE 
TARGET LOCUS. 



DESCRIPTION OF THE INVENTION 

The manuscript included as annex in this application provides a detailed description of 
the methodology used in this application and will be published in the journal Cell on July 
28,1995. 

As described above, base sequence dissimilarities as modest as 0.6% impose a strong barrier to 
efficient homologous recombination in mouse embryonic stem cells (5).. Suppression of homologous 
recombination by small base sequence divergencies was earlier observed in bacteria (k coIt\ S. 
typhimun'um and S. pneumoniae), yeast and mouse fibroblasts (2-4). The role of the DNA mismatch 
correction system in suppressing recombination between homologous but diverged sequences^ 
was most dramatically demonstrated by Radman and coworkers in studying bacterial conjugation 
between the related but diverged species Escherichia coii and Salmonella typhimurium (6). This 
process relies on entry of chromosomal fragmehts of one species into the other and recombination of 
these fragments with the chomosome of the recipient bacteriurTi. The sequence divergence between 

irfterspecies cross is about 2x1 qS fold lower than from an intraspedes cross. However, the 
recovery of exconjugants from the interspecies cross increased 3x1 0^ fold if the recipient bacteria 
canied an inactivating mutation In either the mutS or mutL gene, both of which being essential for 
DNA mismatch connection. 

The central protein in DNA mismatch conection in E CO// is enccKjed by the mu^^^ 
recognizes and binds to base mispairs and small loops of up to four unpaired nucieotides. After 
binding to heteroduplex DNA. the MutS-DNA complex is bound by the mutL gene product which 
leads to excision of a tract of single-stranded DNA of up to several kilobases that contains the 
mispalred nucleotide(s). In this process also the mufty gene product plays a role, whereas the mutH 
gene product ensures removal of newly synthesized strands rather than parental strands. The 
repair process is completed by resynthesis of the excised strand and ligation of the remaining nick. 
Mismatch repair in E. cqli ls responsible for maintaining genome stability in at least two ways: i) by 
recognizing and repairing mis- and unpaired nucleotides that occur, respectively, by 
misincorporation and slippage during DNA replication; ii) by recognizing mismatches occurring in 
heteroduplexes fomried at initial stages of recombination between homologous but not-identical 
sequences. This may either lead to blocking elongation of heteroduplex formation or dissociation of 
the heteroduplex thus aborting the recombination reaction. Consequently. E. co// strains defective 
for either mutS or mutL have a pleiotropic phenotype: an increased mutation rate, including 
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destabilization of simple-sequence repeats and an increased rate of recombination between 
homologous but diverged DNA sequences. The latter phenotype is clearly manifested by the 
efficient recovery of recombinant bacteria resulting from conjugational crosses between the related 
but diverged species Escherichia co// and Salmonella tyqDhimurium y^herem the recipient bacterium 
was deficient \or mutS or mutL (6). Also, the frequency of chromosomal rearrangements by ectopic 
recombination between diverged sequences is substantially elevated in mismatch repair deficient 
bacteria (26). 

In many respects, the biochemistry of mismatch repair systems in eukaryotes resembles 
that of the E. colt mutS,L system. Homologs of both genes have been identified in yeast 
and mammalian cells. Based on mismatch binding in vitro, and on the riiutator and 
recombinator phenotypes of Saccharomyces cerevisiae mutants, the protein encoded by 
the yeast MSH2 gene seems to be the functional homolog of MutS (27-29). A homolog of 
the yeast MSH2 gene was identified in mammalian cells by analysis of a GT-mismatch- 
binding activity, positional cloning and PCR amplification of mouse DNA using degenerate 
primers (30). Similarly, homologs of the E. coli mutL gene were identified in yeast and 
mammalian cells. 

Interestingly, inherited mutations in human mutS and mutL homologs were recently found to 
be related to the cancer predisposition syndrome HNPCC (hereditary nonpolyposis - 
colorectal cancer), which is characterized by development of tumors of the proximal colon at 
eariy age. In these tumors, mismatch repair is lost, as manifested by destabilization of 
simple sequence repeats, the replication error-positive (RER+) phenotype (12). 

In order to investigate the role of the mammalian Msh2 gene in DNA mismatch repair and to 
address the role of mismatch repair in maintaining genome stability, we generated an ES cell 
line carrying a disruption in both copies of the mouse MSH2 gene. This line is disignated 
dMsh2-9. Its construction is described in the accompanying manuscript: 
"Inactivation of the mouse Msh2 gene results in mismatch repair deficiency, 
methyiation tolerance, hyperrecombinatlon, and predisposition to cancer", by 
Niels de Wind, Marleen Dekker, Anton Bems, Miroslav Radman, and Hein te RIele, 
to be published in Cell, the 28th July. 1995 on pages 6, 14, 27 (legend to Rg.1) and 30 
(Fig.1). 

As demonstrated in the manuscript, the phenotypic consequences of Msh2 deficiency in 
mouse embryonic stem cells provid clear evidence for an essential role of MSH2 in 
mammalian DNA mismatch repair. Rrst, Ms/i2-deficient ES cells lack binding activity to a 
double stranded 38-mer oligonucleotide carrying a GT mismatch or an unpaired TG 
dinudeotide [pag s 6. 15. 27 (legend to Rg.2) and 30 (Rg.2)]. Second. Ms/)2-deficient ES 
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ceHs have a mutator phenotype as evidenced by an at least 1 50-fold increase in the 
number of ceHs resistant to B-thioguanine. indicating mutational inactivation of the X-linked 
Hprt gene. Moreover, microsatellite length instability was observed In subclones derived 
■ from the /Ws/)2-deficient ES cell line, but not in subclones derived from wild-type ES cells 
[pages 7, 15. 27 (legend to Rg.3) and 30 (Fig.3)]; Third. Ms/72-deficient ES cells resisted a 
20-fold higher concentration of the methylating agent N-Methyl-N'-Nitro-N-Nitrosoguanidine 
than wild-type ES cells [pages 7/8. 1 6. 27 (legend to Fig.4) and 31 (Rg. 4)]. Fourth, mice 
bred to homozygosity for the disrupted /Ws/?2 allele, originally generated in ES cells, were 
: highly predisposed to tumorigenesis [pages 9, 14/15. and 29 (Table 2)]. 
Thus, dismption of Msh2 leads to inactivation of mammalian DNA mismatch repair. 

By using a gene targeting assay, we demonstrated that it is the rnammalian DNA mismatch 
repair system which is responsible for suppressing homologous recombination between 
sequences differing as little as 0.6% at the nucleotide level. As described above, 
homologous recombination at the locus with an isogenic DNA targeting construct was: 
50-fold more efficient than with a similar, but nontsogenic constmct containing on the 
average 0.6% sequence divergence with respect to the target locus (5); However, in 
Ws/72-deficient ES cells, homologous recombination at flfc with the nonisogenic targeting 
■.^^?^!^:wasas effideritaswtht^ 
.»?lnactivatl51^^Sf^e'moijse A7s/i2 gene results in^^ 
methylatfon tolerance, hyperrecombination, and predisposition to cancer",'by 
Niels de Wind. Marleen Dekker, Anton Bems. Miroslav Radman. and Hein te Riele [pages 
8/9. 16. 28 (Table 1)]. 

This experiment demonstrates that it is the mammalian DNA mismatch repair system which 
is involved in preventing recombination between homologous but diverged DNA 
sequences. 

This finding provides a method for modifying the mammalian genome but also the genome of 
any other eukaryotic organism via homologous recombination using DNA targeting 
sequences that substantially differ from the target locus with respect to the nucleotide 
sequence, by genetic or functional inactivation of the cell's mismatch repair system. 
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GENETIC MODIFICATION OF EUKARYOTIC CELLS USING DNA CONSTRUCTS 
WITH SUBSTANTIAL BASE SEQUENCE DIFFERENCES WITH RESPECT TO THE 
TARGET LOCUS. 
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This manuscript is part of the application and contains a detailed description 
of the methodology as described in the application. This manuscript will be 
published in the journal Cell on July 28, 1995. , 
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Summary 

To investigate the role of the presumed DNA mismatch repair (MMR) gene 
M5ii2 in genome stability and tumorigenesis, we have generated cells and 
mice that are deficient for the gene. Ms A2 -deficient cells have lost mismatch 
binding, and have acquired microsatellite instability, a mutator phenotype 
and tolerance to niethylating agents. Moreover, in these cells, homologous 
recombination has lost dependence on complete identity between interacting 
DNA sequences, suggesting that Msh2 is involved in safeguarding the 
genome from promiscuous recombination. Msh2-dencieat mice display no 
major abnormalities, but a significant fraction develops lymphomas at early 
age. Thus, Msh2 is involved in MMR, controlling several aspects of genome 
stability; loss of MMR-controlIed genome stability predisposes to cancer. 
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Introduction 

Cancer is the endpoint of an evolutionary process in which normal cells acquire full 
malignancy by cumulative genetic or epi-genetic alterations, each conferring proliferative, 
invasive or metastatic potential to the cell. In colorectal cancer, at least six independent 
transforming mutations in oncogenes or tumor suppressor genes are required for a cell to 
develop into a fully malignant tumor (reviewed by Vogelstein and Kinzler, 1993). This 
evolutionary process might be dramatically accelerated by the early loss of systems that 
safeguard genome stability, resulting in an increased rate of mutagenesis and chromosomal 
rearrangements (Hartwell, 1992 and Loeb. 1994). In support of this hypothesis, a number of 
hereditary cancer predisposition syndromes are attributed to defects in DNA repair. Two of 
these are Hereditary Nonpolyposis Colorectal Cancer (HNPCC) and Miiir-Torre syndrome, 
which are characterized by the development of tumors of the proximal colon at early age. 
Also cancers of other parts of the gastrointesdnal tract and of the genitourinary tract are often 
found (Lynch et al., 1993). Muir-Torre patients are, in addition, prone to cancers of the 
sebaceous glands and the skin (Cohen et al., 1991). Individuals affected by one of these' 
syndromes have most likely inherited a mutant allele of either one of four mismatch-repair 
(MMR)-related genes called MSH2 , MLHL PMSl or PMS2 (Leach et al., 1993; Bronner 
et al., 1994; Kolodner et al., 1994; Liu et al., 1994, Nicolaides et al., 1994; Papadopoulos et 
al., 1994; Kolodner et al., 1995; reviewed by Jiricny, 1994 and by Modrich, 1994). In 
tumors of these patients, MMR is lost , presumably through inactivation or loss of the wild- 
type allele (Leach et al., 1993; Nicolaides el a]., 1994; Papadopoulos et al., 1994; Hemminki 
et al. 1994). The mechanisms of loss of MMR, the cause of the apparent tissue specificity of 
tumorigenesis and the role of loss of MMR in the evolution of a normal cell to fully malignant 
tumors are still obscure. Moreover, recent data suggest that MMR is frequently lost in many 
types of sporadic tumors (Han et al., 1993; Parsons et al., 1993; Risinger et al., 1994; 
Chong et al., 1994; Mironov et al., 1994; Orth et al., 1994; Shibata et al., 1994; Suzuki et 
al., 1994; Umar et al., 1994b; Wada et al., 1994; Wooster et al., 1994; Liu et al., 1995). 
This supports the notion that loss of MMR may play a general role in tumor evolution. 
The best studied MMR system with respect to both genetics and biochemistry is the mutSL 
pathway of Escherichia coli (reviewed by Modrich, 1991). Central in this pathway is the 
protein encoded by the mutS gene, which binds to base mispairs and to loops of up to four 
unpaired nucleotides. After binding to the heteroduplex, the MutS-DNA complex is bound by 
the mutL gene product, triggering excision of a tract of single-stranded DNA of up to several 
kilobases that contains the mismatched nucleotide(s). The repair process is completed by 
resynthesis of the excised DNA strand and ligation of the remaining nick. MMR in £. coli is 
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responsible for maintaining genome integrity in at'least two ways: (i) by recognizing and re- 
pairing mispaired and extrahelical nucleotides that occur by, respectively, misincorporation 
and slippage during replication and (ii) by recognizing mismatches occurring in 
recombination intermediates between homologous but diverged sequences, leading to 
abortion of recombination: Consequently, coli strains deficient for either mutS or ntutL 
have a pleiotropic phenotype: an increased mutation rate, including destabilizadon of simple- 
sequence repeats (also called microsateUites; the 'Replication Error+\ or RER+ phenotype) 
and promiscuity in inter- and intragenomic recombination (the recombinator phenotype, 
Rayssiguieret al., 1989; Petit et al., 1991). 

In many respects the biochemistry of MMR systems in eukaryotes resembles that of the 
mutSL system (Holmes et aL, 1990; Varlet et al., 1990; Thomas et al., 1991; Fang and 
Modrich, 1993). Homologs of both mutS and mutL were found in eukaryotes. Based on 
mismatch binding in vitro and on the mutator and recombinator phenotypes of 
Saccharomyces cerevisiae mutants, the protein encoded by the yeast MSH2 gene (Reenan 
and Kolodner, i992a) seems to be the functional homolog of MutS (Reenan and Kolodner, 
1992b; Miret et al, 1993; Alani et al, 1995). In higher eukaryotes (Xenopus laevis, mouse' 
and human) homologs of the yeast MSH2 gene have recently been identified, based on either 
sequence homology, direct sequencing of the major mismatch-binding protein or by 
positional cloning (Fishel et al., 1993; Leach et al., 1993'; Palombo el al., 1994; Varlet et al., 

protein, when overproduced and purified from E. coli or yeast, to stretches of up to 14 
exurahelical nucleotides (so called insertion-deletion-type loops, or IDLs) in vitro (Fishel et 
al., 1994b; Alani et al.. 1995), suggestive of a role for Msh2 in the repair of these loops 
(Umar et al, 1994). Eukaryotic homologs of the mutL gene, called MLHl, PMSl and PMS2 
have been analyzed in yeast (Kramer et al., 1989a; Kramer et al., 1989b; Strand et al., 1993; 
Prolla et al., 1994a; Prolla et al., 1994b) and human cells (Li and Modrich, 1995 
The precise role of these mutS and mutL homologs in MMR and the role of MMR in the 
maintenance of genome stability in mammalian ceUs remain to be established. In addidon, the 
causative role of MMR in cellular sensitivity to simple methylating agents, as was suggested 
by the loss of MMR after selection of cells for tolerance to these agents (Branch et al., 1993; 
Kat et al., 1993) and by the methylation tolerance of a MMR-deficient tumor cell line (Koi et 
al., 1994) remains to be explored. 

To investigate the involvement of Msh2 in mammalian MMR and the roles of MMR in 
maintaining genome stability we have generated mouse Embryonic Stem (ES) cell lines 
which lack both copies of the Msh2 gene. We show that Af5A2-deficient cells lack bmding to 
some specific mismatches, hav acquired a mutator phenotype and are tolerant to simple 
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methylating agents. Moreover, these ceils have lost the barrier to recombination between 
homologous but diverged sequences. Taken together, these results suggest that MMR is 
essential for the preservation of multiple aspects of genome stability in mammalian cells. 
To study the effect of MMR deficiency on the etiology and evolution of cancer we generated 
mice lacking one or both copies of the Msh2 gene in part or all of their cells. Although 
heterozygous mice, a model for HNPCC, remain healthy to at least 8 months of age, a 
significant fraction of both chimeric and fully homozygous Msh2 knock-out mice succumbs 
to lymphomas. These results provide evidence that loss of MMR-controlled geinome stability 
can accelerate the development of cancer. 
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Results 

Targeted disruption of the Msh2 gene in mouse ES ceils 

Heterozygous ES cell lines sMsh2-42 and sMsh2'55, carrying a hygroraycin-resistance gene 
inserted between codons 588 and 589 of one allele of the M^;i2 gene, were generated by gene 
targeting using the construct shown in Fig. 1 A. Cell line dM5A2-9, carrying the hyg marker 
in both alleles of the Af5;z2 gene, was obtained by selecting cell line sMj/z2-55 for duplication 
of the chromosome carrying the targeted Msh2 allele at an elevated hygromycin B 
concentration, a method that was previously shown to be effective using the neomycin- 
resistance marker (Mortensen et al, 1992). Targeting events were detected by Southern blot 
analysis, using probes flanking both sides of the targeting construct (Fig. lA). An example 
of such a Southern blot, demonstrating the status of the MjA2 alleles in cell lines wt-2 (a 
control cell line with the targeting construct integrated by non-homologous recombination), 
sMsh2'55 and dMsh2-9 is shown in Fig. IB. Diploidy of cell lines sA/5/z2-55 and dMsh2-9 
was verified by karyotyping (not shown). 

The hyg marker was inserted, in the antisense orientation, N-terminally of the putative ATP 
binding site (Fig. lA) in a region where (in the human gene) truncations were found in many. 
HNPCC tumors (Liu et al., 1994). Moreover, no Msh2 transcript could be detected in ES 
cell line dM5;i2-9 by Northern blotting (not shown), indicating that the hyg marker strongly 

ihdica&d l)y M^2-/-r ^ ^ 

Mismatch binding in Msh2-I' cells 

Two different mismatch-binding activities have been described in mammalian cell extracts 
using gel-retardation assays: (i) An activity that recognises G.T mismatches (Jiricny et al., 
1988; Stephenson and Karran, 1989; Griffm and Karran, 1993), an cxtrahelical TG 
dinucleodde (Aquilina et al; 1994) representing an intermediate in replicadonal slippage of a 
microsatellite and, weakly, G.A, C.G, A.C and G.U mismatches (Stephenson and Karran, 
-1989; Hughes and Jiricny, 1992), (ii) In one cell line, an independent activity was detected 
that recognizes A.C mismatches and also pyrimidine-pyrimidine mismatches (Stephenson 
and Karran, 1989). Based on protein sequencing, the purified G.T-binding activity was 
shown to contain the MSH2 protein (Palombo et al., 1994). To characterize the mismatch- 
binding properties of the Msh2-f- ES cell line, we have performed gel-retardation assays 
using cell extracts of wild-type ES cell line wt-2 and i^5/z2-deficient line dMsh2-9, Extracts 
were incubated with radiolabeled 38-mer double-stranded oligonucleotides containing either 
a mismatch, an extrahelical dinucleotide, or a 14-nucleotide insertion-deletion-type loop 
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-(IDL), followed by electrophoresis on a nondenaturing polyacrylamide gel. Results of these 
experiments as shown in Fig. 2 clearly demonstrate binding activity in wild-type cell extracts 
to a G.T mismatch and to an extrahelical TG dinucleotide. Binding to both oligonucleotides 
was entirely absent in the extract of the Mshl-I- ES cell line, demonstrating the involvement 
of the Msh2 protein in this binding activity. As shown before (Hughes and Jiricny, 1992), 
binding was abolished by the inclusion of ATP in the binding reaction (not shown). We were 
unable to detect specific binding to G. A, G^G, or A.C mismatches under the conditions used 
in wild-type or mutant cell extracts. In addition, no Msh2-dependent binding to a 14- 
nucleotide lDL was observed (Fig. 2). 

Mshl'h cells have a mutator phenotype 

The Mshl'l' cell line was indistinguishable from the wild-type cell line wt-2 and the parental 
heterozygous cell line sMsh2'55 with respect to growth and plating efficiency (not shown). 
To address the role of Msh2 in MMR, we determined the stability of two microsatellite 
markers in Mshl-f- cell line 6Msh2-9 and A/5/z2+/-f line wt-2. For this purpose, 24 
subclones were derived of each cell line, which had undergone approximately 20 cell' 
divisions since its generation. Chromosomal DNA, isolated from each expanded subclone 
was subjected to the polymerase chain reaction (PCR) using two end-labelled primer pairs 
{D7Mitl7 and D14Mitl5; Dietrich et al., 1994). Whereas none of the wt-2 sublines showed 
any alteration in microsatellite length, clear length alterations in many 6Msh2-9 sublines were 
seen for both microsatellites (8 out of 24 for marker D7Mitl7 and 6 out of 24 for marker 
DMMitlS, Fig.3). This observation strongly suggests that the Msh2 protein is involved in 
the repair of slipped replication intermediates. The microsatellite-slippage rate was estimated 
to be 10"2 to 10-3 per generation. 

We subsequentiy investigated the effect of loss of the Msh2 gene on the mutation frequency 
of a functional gene. To this purpose, 6x10^ cells of cell lines wt-2 and dMsh2-9 were plated 
in the presence of 6-thioguanine (6-TG) to select for cells that have lost activity of the X- 
linked Hprt gene by mutation. Whereas no 6-TG-resistant colonies were seen in the wt-2 
culture, 188 resistant colonies were present in the dMsh2'9 culture. This result extends the 
mutator phenotype of Msh2~l' cells to functional genes. 

Msh2'/' cells are tolerant to N-methyi-N'-nitro-N-nitrosoguanidine (MNNG) 
MMR was suggested to mediate hypersensitivity to agents that methylate G residues at the 
position. It is believed that MMR recognises the O^-meG.T mispair that occurs after 
erroneous incorporation of a thymidine nucleotide opposite to O^-meG during replication 
(Griffm et al., 1994). This is supposedly followed by excision and repair synthesis, again 
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-incorporating thymidine opposite to O^-meG, triggering a new round of MMR. The net result 
of this cycling between excision and resynthesis will be the presence of single-stranded 
regions at the site of 06-meG residues which will lead to double-stranded gaps when 
replicated during S phase, resulting in cell. death (for reviews., see Karran and Bignami; 
1992; Karran and bignami, 1994). It is predicted that loss of MMR will prevent this process, 
thus conferring tolerance to simple methylating agents. To test this hypothesis direcUy' 
survival of Msh2./- cell line dM^/«2-9.,A/^/,+/- cen line sA/^A2-21 and wild-type cell line Wt-2 
was determined after exposure to a range of concentrations of N-methyl-N'-hitro-N- 
nitrosoguanidine (MNNG). In the growth medium. O^-benzylguanine was included to 
competitively inhibit endogenous methyltransferase activity that might odierwise remove the 
methyl groups added by MNNG (Dolan et al., 1990). Fig. 4 shows that the LD50 for, 
MNNG was increased 20-fold in the Mshl-I- cell line compared to the heterozygous and 
wild-type cell lines, directly proving the involvement of Msh2 in determining cellular 
sensitivity to MNNG. Heterozygous cell line sM5A2-21 displayed no increased tolerance to 
MNNG. 

Loss of suppression of recombination between diverged sequences in 
Msh2-/- cells . 

In E coli (Shen and Huang. 1986). Drosophila (Nassif and Engels. 1993) and mammalian 



between homologous DNA stretches is highly dependent on their sequence identity. We have 
previously demonstrated that in a gene targeting assay, homologous recombination at the 
Retinoblastoma (Rb) locus in ES cells, derived from mouse strain 129. was 50-fold more 
efficient with a 129-derived targeting construct than with a construct derived from a non- 
isogenic (BALB/c) mouse strain (Te Riele et al. 1992; fable 1). This construct contained 
0.6% base sequence divergence with respect to the isogenic 129 construct. To provide 
additional evidence -that suppression of recombination was solely dependent on the 
polymorphisms between the endogenous locus and die targeting DNA, we performed the 
inverse experiment, i.e. targeting of a BALB/c-derived ES cell line with the 129- and 
BALB/c-derived constructs. This experiment yielded the inverse result, i.e. a higher targeting 
efficiency with the BALB/c targeting construct tiian with the non-isogenic 129 targeting 
construct (Table 1). To investigate whether MMR is responsible for this antirecombmation 
effect, we have repeated the Rb targeting experiment in die (129-derived) ES cell line dMjA2- 
9. We found that in the Msh2-/- cell line homologous recombination at the Rb locus with the 
non-isogenic construct was as efficient as with the isogenic construct (Table 1). This 
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experiment demonstrates that Msh2 is involved in preventing homologous recombination 
between diverged DNA sequences. 

Msh2-deficient mice are viable but predisposed to tumorigenesis 

( To study the effect of MMR deficiency on the development of cancer, Mshl-^l- ES cell lines 
sMsh2'55 and sMsh2-42 and Msh2-i- ES cell line dMsh2'9 were used to generate chimeric 
mice. Both the Msh2 single and double knock-out ES cells gave rise to healthy animals >yith 
coat-color chimerism of 20 to 70 %. This result suggests that MMR deficiency did not alter 
the capacity of the ES cells to compete with wild-type ceUs in chimeric development. 
A/5/i2+A chimeras were found to transmit the mutant allele through the germ line resulting in 
Msh2 heterozygous Fl mice. These mice, which may serve as a model for HNPCC, were 
healthy up to at least 8 months of age (Table 2). 

Intercrossing of MjA2+/- mice resulted in Mshl-I- mice according to a normal Mendelian 
distribution; they were healthy at birth and fertile. However, 30% of the A/j/i2-/- mice 
developed metastasizing lymphomas of T-cell origin with a peak at 2 months of age (Table 
2). In addition, one mouse suffered from generalized histiocytic sarcoma at 3.5 months of' 
age. Extensive histological analyses of these mice did not reveal any other abnormality. Also 
healthy Msh2-I- mice sacrificed at 2 months of age were histologically normal and had 
nonn£il B- and T-cell populations as evidenced by FACS analysis. 
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Discussion 

Lack of mismatch binding in M5A2-/. cell extracts 

Whereas wild-type cell extracts display efficient binding to both a G.T mismatch as weU as to 
an extrahelical TG dinucleotide. extracts of the ES cell line dMsh2-9 lack this binding (Fig 
2). suggesting that the Msh2 protein is essential for binding to these heteroduplexes. We 
were unable to detect specific binding activity to other mismatches under the conditions used 
This result is compatible with binding studies by others which demonstrate that binding to a 
G T mismatch is stronger than to other mismatches (Stephenson and Karran, 1989; Hughes 
and Jiricny. 1992) whereas the independent A.C binding activity is apparently cell-type 
specific (Jiricny et al.. 1988; Stephenson and Kairan, 1989); The methylaUori-tolerant cell 
lines 'clone B' and RajiF12 were also found to lack binding to both a G.T mismatch (Branch 
et al., 1993; Griffin et al., 1994) and to an extrahelical TG dinucleotide (Aquilina et al., 
1994), Binding to an A.C mismatch was unaffected (Branch et al., 1993). Therefore, these 
cell lines may have lost the Msh2 gene. Purified human MSH2, overproduced in E. coli, and 
purified yeast MSH2 were shown to bind to G.T mismatches (Fishel et al., 1994a; Alani et' 
al., 1995). but also strongly to IDLs of up to 14 nucleotides (Fishel et al., 1994b; Alani et 
al., 1995). Surprizingly, we could not detect Msh2-dependent binding to a 14 nucleotide IDL 
in ES cell extracts. We are unable to explain this apparent discrepancy. In contrast to tiie 

containing G.T mismatch-binding protein fraction purified from HeLa cells (Hughes and 
Jiricny, 1992; Palombo et al., 1994), the binding capacity of the purified MSH2 proteins was 
not abolished by ATP (Fishel et al., 1994b; Alani et al., 1995). This suggests that the 
punfied MSH2 proteins behave differently from the namraily produced protein. The finding 
that in Msh2-/. cells length shifts in microsatellites were never larger than 4 nucleotides 
(representing two CA dinucleotides; Fig. 3). suggests that binding activity of Msh2 to 
extrahelical loops longer than 4 nucleotides is not required to maintain microsatellite stability. 
We are currently initiating /« vitro repair assays using wild-type and Msh2-/- cell extracts to 
further elucidate this point. 

Mutator phenotype of Af5/t2-dencient cells 

Cell line dMsh2.9 had acquired microsatellite instabiUty (tiie RER+ phenotype), indicating 
failure to repair slipped replication intermediates. The frequency of microsatellite instability 
was similar to tiiat of some RER+ tumor cell lines (Parsons et al.. 1993; Bhattacharyya et al., 
1994). The mutator phenotype of these cells also affects functional genes as evidenced by the 
rate of Hprt deficiency in a dMsh2-9 culture which was increased from undetectable (i.e. 
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lower than 1.6x10"'^) for wild-type cells to 3x10-^. This frequency is in agreement with that 
found in some RER"*" tumor cell lines (Bhattacharyya et al., 1994; Shibata et al,, 1994; 
Eshleman et al., 1995); although cell lines that have been selected for MMR deficiency using 
a methylating agent tend to have somewhat lower mutation frequencies (Goldmacher et al., 
1986; Branch et al., 1993; Kat et ai., 1993; Aquilina et al., 1994). Growih.rate and plating 
efficiency oif M5/i2-deficient, heterozygous and wild-type cells were indistinguishable, 
suggesting that the mutator phenotype does not interfere with viability of the cells. This is 
remarkable, since both were seriously affected in some cell lines that were selected for MMR 
deficiency using methylating agents (Goldmacher et al., 1986). This suggests that in, the latter 
cells, additional events may have taken place that interfere Avith cell viability. Based on the ; 
mutator phenotype of dM^A2-9 cells and on the binding studies described above. :,we 
conclude that MshiZ is involved in; and essential for, postreplicational MMR. - 

Sensitivity to simple methylating agents is mediated by MMR 

Sensitivity of cells to agents that mcthylate DNA at the position of G residues has been, 
attributed to futile attempts of MMR to correct the resulting O^-mcG.T, basepairs (see 
results). This hypothesis is supported by two observations: (i) Prolonged selection of cells 
for tolerance to Methylniurosourea or MNNG frequently results in loss of MMR (Kat et al,, 

1993) and loss of G.T-binding activity (Branch et al., 1993; Griffin et aJ., 1994). (ii) An 
RER+ tumor cell line was found to be tolerant to these agents and sensitivity was restored by ; 
introduction of the chromosome containing a wild-type allele of the defective mismatch-repair 
gene (Koi et al., 1994). Our finding; that Mj/i2-deficient cell line dM5ft2-9 had acquired , 
tolerance to MNNG provides direct evidence in support of this hypothesis. This finding may 
have important clinical implications. The triazene class of carcinostatic drugs probably 
derives its cytotoxic effect from methylaiion at the position of G residues (Michejda et al., 

1994) . Whereas MMR-deficient tumors are supposedly tolerant to these drugs, they may 
elicit severe cytotoxity in normal tissues of the patient. In addition, the persistence of O^- 
meG.T mispairs in MMR-deficient tumor cells might lead to increased G to A transitions 
(Aquilina et al., 1993), resulting in acceleration of tumor progression. Finally, the finding 
that simple methylating agents select, or induce, MMR deficiency in cells has led to the 
hypothesis that this process is involved in determining the tisssue specificity of tumorigenesis 
in HNPCC patients, due to the presence of such agents in the intestine (Karran and Bignami, 
1994; Varlet etal.^ 1994). 
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MMR edits homologous recombination in 'mammalian cells 
Previously, it has been shown that the presence of less than I % nucleotide difference can 
strongly suppress hoinologous recombinaUon in mammalian cells (Waldman and Liskay. 
1988, Te Riele et al., 1992). Here, we have extended these results and we have demonstrated 
that Msh2-/^ cells have lost this heterology-dependent suppression of recombination. Based 
on results obtained in E. coli (Worth et al.. 1994). this function is probably mediated by the 
Msh2 protein, by recognizing mismatches that occur during heteroduplex formation between 
two not perfectly complementary DNA strands. This may directly destabilize the 
heteroduplex or. altemaUvely. prevent elongation of strand exchange. Similar results have 
recently been obtained with M5//2-deficient yeast strains (Alani et al.. 1994; Selva et al - 
1995). suggesting that this role of MMR is strongly preserved during evolution. In E. coli 
loss of MMR has been shown to result in a high frequency of inter- and intrachromosomal 
rearrangements due to ectopic recombination between diverged sequences (Rayssiguier et al 
1989; Petit et al.. 1991). More tiian the E. coli genome, mammalian genomes contain many 
repeated but diverged sequences. Examples of these are gene families. SINE sequences. 
LINE sequences and pseudogenes (Brosius and Gould, 1992; Nowak, 1994). Ectopic' 
recombination between these sequences would lead to genomic rearrangements that may 
either be detrimental for cell viability due to loss of essential genomic information or. 
alternatively, lead to activation of oncogenes or inactivation of- mmor suppressor genes 

be allowed to proceed, however, to allow recombinational repair of double su-and breaks 
using the (identical) sister chromatid as sequence donor. For this reason, mechanisms 
counteracting homologous recombination when both recombining stretches comain 
divergencies, while permitting recombination to proceed when both stretches are identical 
may be of great importance for die mammaHan cell to enable repair witiiout risking genome 
rcarrangment. Based on the effect of MjA2-deficiency on gene targeting, it is likely that 
MMR plays this scrutinizing role. 

Our finding that Msh2 acts not only anti-mutagenic but also anti-recombinogenic. suggests 
that MMR deficiency may conUibute to tumorigenesis by elevating the rate of both point 
mutations and chromosomal rearrangements. 

Role of AfsA2 deficiency in tumorigenesis 

A/^A2-deficiency did not interfere with mouse embryonic developmem and is compatible witii 
adult life. This observation substantiates the recent finding that some HNPCC patients seem 
largely MMR deficient in normal cells (Parsons et al.. 1995). A significant fraction of 
chimeric and pure Msh2./. mice developed metastasizing lymphomas. It is likely that die 
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-developing immune system in newborn mice, in which a high turnover of maturing T cells 
exists, offers a window where MMR deficiency can strongly accelerate the accumulation of 
transforming events. 

The basal level of mutagenesis in the cell, estimated to be approximately 10-10 per nucleotide 
per generation, is considered to be insufficient to sustain the multistep clonal outgrowth of 
cells in tumorigenesis (Loeb, 1994). Therefore an increased mutation rate, such as caused by 
MMR deficiency, is likely to be obligatory to accelerate this process (Loeb, 1994). Our 
results suggest that a rapid cell expansion is required to unveil the effect of MMR deficiency 
on tumor development. Corroborating this notion, recent studies of microsatellite instability 
during cancer development show that loss of MMR probably does not occur as the initial 
step, but rather within an early tumor stage'such as an adenoma (Young et al., 1993; 
Aaltonen et al., 1994; Chong et al., 1994; Shibata et al., 1994; Wada et al., 1994), which, in 
HNPCC patients, have been found to evolve rapidly into carcinomas (Lynch et al., 1993; 
Jass et al., 1994). Moreover, two recent studies support the notion that enhanced 
mutagenesis due to MMR deficiency is responsible for tumor progression: (i): Some RER+ 
tumors had acquired frameshift mutations in the P2-Microglobulin gene within, a CT repeat,' 
indicative of unrepaired replication slippage, and putatively leading to escape from immune 
surveillance (Bicknell et al., 1994); (ii): in some RER+ colon carcinomas from HNPCC 
patients, p53 and APC were found to contain either frameshifts in repeated nucleotides or G 
to A transitions, indicative of the failure to repair slipped simple sequence repeats and G.T 
mismatches, respectively (Lazar et al., 1994). 

In conclusion, our studies on Msh2 deficiency prove the involvement of Msh2 in MMR and 
show that MMR functions as a mechanism safeguarding the mammalian genome against 
mutation by misincorporation and recombination between diverged sequences. In addition, 
our results lend support to the hypothesis that MMR deficiency can strongly accelerate 
malignant transformation of rapidly expanding cell populations. 
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-Experimental procedures 
Disruption of Msh2 in mouse ES cells 

Genomic Msh2 fragments were obtained by screening a 129pLA-derived genomic DNA 
library with a murine Mshl cDNA probe (Varlet et al., -1994). The targeting construct; was 
prepared by subcloning a 12.5 kbp 5amH I fragment and inserting a hygromycin resistance 
gene (from PGKAy^, te Riele et al., 1990) into the unique I site located within an exon 
sequence of Msh2, Cloning procedures were performed according to Sambropk et al. 
(1989). 

The targeting construct was seperated from vector sequences by gel electrophoresis, purified 
by electro elution and introduced into 1290LAHlerived ES cell line E 14 by electroporation as 
described (te Riele et al., 1992). Electroporated cells were seeded onto gelatin-coated lO-cm 
dishes (10^ cells per plate) and subjected to hygromycin B selection (150 |agper ml) in BRL- 
conditioned medium (Hooper et al., 1987) the following day. After 10 days, individual 
hygromycin B -resistant colonies were randomly picked and expanded on mouse embryonic 
fibroblasts feeder layers. DNA was extracted from expanded colonies, digested with EcoK I 
and analyzed by Southern hybridization using probes flanking both sides of the targeting 
construct (Fig. lA). Two cell lines were obtained out of 135 hygromycin B-resistant 
colonies showing bands diagnostic for correct integration of the /ly^marker between codons 
^5^nd^|£^%ne^cji^^ 

sMsh2-55, One cell line obtained from this experiment, designated wt-2, carrying a randomly 
integrated hyg gene, was used as an Mshl-^/-^ control. 

To obtain an ES cell line carrying a disruption in both copies of Msh2, lO^ cells of cell line 
sAfj/i2-55 were plated onto 10-cm plates and cultured in BRL-conditioned medium 
containing 1.0 or 1.5 mg per ml of hygromycin B. After 12 days of of culturing in selective 
medium and 7 days in non-selective medium, 24 colonies were obtained which had survived 
1.5 mg per ml of hygromycin B. DNA was extracted from expanded colonies, digested ^yith 
EcoR I and analyzed by Southern hybridization. One cell line, designated dA/j/i2-9, carried 
two disrupted Msh2 alleles, had lost the wild-type copy and contained the normal number of 
chromosomes. ES cell lines which had survived 1.0 mg per ml of hygromycin B still 
contained one disrupted and one wild-type AfjA2 copy. One of these Msh2^f- liiies was 
designated sAf5/i2-21 and used for several experiments. 

Generation of Msh2 mutant mice 

Chimeric mice were obtained by injecting 10-15 cells of ES cell lines sMsh2-55, sMsh2-42 
and dMsh2-9 into C57B1/6 blastocysts. Male chimeras obtained with Msh2'^^- ES cells were 
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crossed with wild-type 1290LA and FVB mice and found to transmit the mutated Msh2 allele 
through the germ line. Homozygous Msh2 mutant mice were obtained by intercrossing Fl 
heterozygotes. 

Gel shift assay 

Preparation of cell extracts, annealing of oligonucleotides, binding of cell extracts to duplex 
oligonucleotides containing mismatched or extrahelical nucleotides, and nondenaturing poly- 
acrylamide gelelectrophoresis were performed essentially as described (Stephenson and 
Karran, 1989). However, gelelectrophoresis was performed in TAE buffer rather than in 
TBE buffer. To obtain duplex oligonucleotides, the oligonucleotide U: 5'-GGGAAGCT- 
GCCAGGCCCCAGTGTCAGCCTCCTATGCTC-3' (sequences were derived from 
Aquilina et al., 1994) was radiplabelled and annealed with any of the following unlabelled 
oligonucleotides: L-G.T: S^-GAGCATAGGAGGCTGACArTGGGGCCtGGCAGCrTT- 
CCC-3' (resulting in a G.T mismatch); L-G.A: 5*-GAGCATAGGAGGCTGACAATGG- 
GGCCTGGCAGCTTCCC-3* (resulting in a G.A mismatch); L-G.G: 5 -GAGCATAGG- 
AGGGTGACAGTGGGGCCTGGCAGCTTCCC-3' (resulting in a G.G mismatch);' 
L-A.C: 5'-GAGCATAGGAGGCTGACACCGGGGCCTGGCAGCTTCCC.3\ (resulting 
in an A.C mismatch); L-TG: 5*-GAGCATAGGAGGCTGACACTGTGGGGCCTGGCA- 
GCTTCCC-3 ■ (resulting in an extrahelical TG dinucleotide); L-HOM: 5 -GAGCATAGGA- 
GGCTGACACTGGGGCCTGGCAGCTTCCC-3' (resulting in a homoduplex); 
L.LOOP14: 5*-GAGCATAGGAGGCTGACACAT ACGTGAGTACTCTGGGGCCTGG- 
CAGCTTCCC-3' (resulting in an IDL loop of 14 extrahelical nucleotides). In all assays, a 
twofold excess of unlabelled homoduplex competitor oligonucleotide was included. As a 
positive control, a duplex oligonucleotide containing the binding site for the E2F family of 
transcription factors was used (Beijersbergen et al., 1995). 

PGR amplification of microsatellites 

Subclones of ES cell lines dMsh2-9 and wt-2 were generated by seeding cells onto mouse 
embryonic fibroblasts feeder layers at a density of ICP cells per 10 cm^. At that time, the ES 
cell lines were in culture for approximately 20 divisions since their generation. Twenty-four 
colonies from each cell line were expanded. Chromosomal DNA was isolated and subjected 
to the polymerase chain reaction using two end-labelled primer pairs {D14Mitl5 and 
D7Mitl7, Dietrich et al., 1994). Amplified products were electrophoresed on a denaturing 
polyacrylamide gel. 
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.Mutation frequency 

6x106 cells of ES cell lines dmsh2-9 and wt-2 were plated onto 486 cm2 gelatin-coated 
tissue culture surface in BRL-conditioned medium; After two days, 6-Thioguanine was 
added at a concentration of 1 0 per ml. After two weeks the number of resistant colonies 
was counted. • 

Sensitivity to MNNG 

ES cell lines dMsh2-9, sMsh2-2 1 and wt-2 were seeded onto MEF feeder layers at a density 
of 103 ceils per 4 cm2- The following day, cells were exposed for one hour to MNNG 
ranging in concentration from zero to 36.45 in serum-free medium. After 4 days of 

incubation, cells were trypsinized, stained with Trypan blue and counted. During the whole 
procedure frpni one hour before exposure to MNNG. O^.tenzylguanine (20 |iM. kindly 
provided by A. Pegg) was included in the medium in order to competively inhibit 
endogenous methylttarisferase activity that might otherwise remove the methyl groups added 
by MNNG (Dolan et al, 1990). 

Rb targeting with isogenic and non-isogenic DNA 

Targeting and subsequent analysis of the /J* locus in a BALB/c-derived ES cell line (kindly 
provided by S; Rastan) arid 129/GLA-derived ES cell lines dMsh2-9 were performed 

from the 1296la genome) and B/cRb-neo (derived from the BALB/c genome) carry an 
insertion of the pMC Ineo marker in exon 19 of the Rb gene and differ approximately 0.6% at 
the nucleotide level (te Riele et al, 1992). 
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Legends 

Fig. 1. Targeted disruption of the mouse Af5/i2 gene. 

A: Top: schematic diagram of the mouse Msh2 protein. ATP binding and HTH: putative 
ATPase and hehx turn helix domains, respectively (Varlet et al, 1994). The position at which 
the protein is disrupted by insertion of the Hyg marker is indicated. Bottom: the genomic 
Msh2 locus; indicated below is the targeting constract carrying the Hyg marker inserted into a 
unique 5naB I site within an exon sequence of Msh2. The positions of the external probes 
used for detection of targeting events are indicated. 

B: The Msh2 locus in Mshl+Z-^ cell line wi-2, M5/i2+/- cell line sMshl-SS arid Msh2-/- cell . 
line dMsh2-9, Genomic DNA of these cell lines was digested with EcoRI and analyzed by 
Southern hybridization using probe 1 (Fig. lA). Arrowheads indicate the positions of the 
wild type (wt, 20 kbp), and disrupted (ko, 7.5 kbp) alleles. 

Fig. 2. Mismatch-binding activity in wild-type and Af5/i2-deficient cells. 

Binding activity in extracts of Af j/i2+/+ cell line wt-2 and Msh2-/- cell line 6msh2-9 to 

double-stranded oligonucleotides containing a mismatch (G.T, G.A, G.G, or A.C), an 

extrahelical dinucleotide (Extrahelical TG), or an extrahelical 14-nucleotide loop (LOOP . . 

14) and a Homoduplex was assessed using a gel retardation assay. Sequences of the 

oligonucleotides are given under Experimental Procedures. Arrowheads indicate the positions 

of mismatch-specific complexes. E2F: Binding to an oligonucleotide carrying an E2F site 

served as a positive control (Beijersbergen et al, 1995). v v- ^ 

Fig. 3. Microsatellite stability in wild-type and Msh2 -deficient cells. 

Genomic DNA of 24 subclones derived from Msh2'^/+ cell line wt-2 and Msh2-f- cell line 
dmsh2-9 was amplified by PCR using primer sets D7Mitl7 and DMMitlS. Arrowheads 
indicate microsatellite alleles with a detectable length alteration. 

Fig. 4. Tolerance of M5/i2-deficient ceDs to the simple methylating agent MNNG. 
ES cell lines wt-2 (M5/12+/+), sMsh2'55 {Msh2'^/') and dMsh2'9 (Msh2-/-) were exposed to 
increasing amounts of MNNG for one hour in the presence of O^-benzylguanine. After four 
days of incubation, cells were trypsinized and counted. 
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Table 1. Homologous recombination has lost dependence on sequence identity in Msh2 
deficient ES cells. 



ES cells 



Homologous recombination 
vs total no. 0418^ col. with 



Isogenic vs 
non-isogenic 



^29 Rb-neo 



B/cRb-neo 



BALB/c {Msh2'^/'^) 



1/68 (1.5%) 



16/72 (22%) 



15x 



1290LA (Msh2'¥/'¥) 



33/94 (35%) 



1/144 (0.7%) 



50 X 



^ 290LA {Msh2'/') 



42/185 (23%) 



47/184 (26%) 



0.9 X 



strains of mice, respectively, with 0.6% sequence divergency. Targeting frequencies in 
1290LA (A/5/i2+/+) ES cells are derived from te Riele et al., 1992. 
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_Table 2. Phenotypes of Msh2 mutant mice. 



Genotype 



Age 

(months) 



Condition 
at birth 



Tumor incidence 



type 



frequency** age 

(months) 



Msh2-¥l- 



8 



healthy 



0/110 



Msh2+/r.:Msh2'/'* 



healthy 



Lymphoma 1/24 



Msh2-/- 



4-5 healthy Lymphoma 6/19 

Histiocytic 1/19 
sarcoma 



2-4 
3.5 



*Chimeras consisting of wild-type and Mshl-J- cells 

**Number of mice carrying a tumor per total number in the experiment 
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-GENETIC MODIFICATION OF EUKARYOTIC CELLS USING DNA CONSTRUCTS 
WITH SUBSTANTIAL BASE SEQUENCE DIFFERENCES WITH RESPECT TO THE 
TARGET LOCUS. 



CLAIMS , 

1. A method for modifying the genome of a eukaryotic cell by homologous recombination 
between a specific locus in the gertome and a DNA sequence introduced into a cell in which 
the mismatch repair system has been genetically and/or transitorily inactivated and said 
sequence having up to 30% base sequence differences with respect to the target locus in 
the region where recombination can take place or saicf sequence differs from ttie target locus 
to such a level that homologous recombination is nomnaiiy strongly impeded. 

2. A method according to claim 1 wherein the cell's mismatch repair system is inactivated by 
disnjption of both copies of one of the genes essential for DNA mismatch repair. 

3. A method according to claim 1 wherein the cell's mismatch repair system is inactivated by 
introduction of antisense RNA constructs, driven by an appropriate promoter, thus 
functionally inactivating one of the genes essential for DNA mismatch repair. 



4. A method according to claim 1 wherein the cell's rhismatch repair system is inactivated by 
introduction of modified antisense oligodeoxynucleotdes, thus functionally inactivating one 

the genes essential for DNA mismatch repair. 

5. A method according to claim 1 wherein the cell's mismatch repair system is inactivated by 
expression of a dominant negatlvily acting version of a gene involved in DNA mismatch 
repair. 

6. A method according to claim 1 wherein the cell's mismatch repair system is inactivated 
transitorily by saturation of the mismatch repair system by the introduction into the cell of 
DNA molecules earring one or more mis- or unpaired bases. 

7. A method according to claims 1 -6 wherein the target cell is an embryonic stem cell or a 
related cell type derived from any mammalian species, to be used to generate chimeric 
animals and chimeric animals that can transmit the mutated genome through the genii line. 

8. A method according to claims 1-6 wherein the target cell is a cell lin derived from any 
mammalian species and cultured in vitro. 
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- 9. A method according to claims 1-6 wherein the target cell is derived from any organ of any 
mammalian species. 

10. A method according to claim 9 wherein the modified target cell can be reintroduced into 
the organism. , 

1 1 . A method according to claims 1 -6 wherein the target ceil is a plant cell. 

1 2. A method according to claims 1-6 wherein the target cell is derived from outbred 
organisms implicating that they contain nbnidentical copies of homologous chrorriosomes. 

1 3. A method according to claim 1 wherein the targeting DNA at the region where 
homologous recombination can take place is derived from the same species as the target 
cell. 

14. A method according to claim 13 wherein the targeting DNA is modified in vitro at. specific 
sites thus deliberately introducing sequence divergence. 

1 5. A method according to claim 1 wherein the targeting DNA at the region where 

homologous recombination can take place is derived from the same species as the target cell * ? 

but is nonisogenic with the DNA of the target cell, the two sequences differing up to 5% at 

the nucleotide level. f 

16. A method according to claim 1 wherein the targeting DNA at the region where 
homologous recombination can take place is derived from another species as the target cell, 

the targeting sequence and the target locus comprising up to 30% sequence divergence in . 
the region where homologous recombination can t£ike place. 

17. A method according to claim 16 wherein the targeting DNA is a chromosomal DNA 
fragment carried on a YAC or cosmid vector. 

1 8. A method according to claim 1 wherein the targeting DNA is an double- or single- 
stranded oligonucleotide consisting of 10-100 bases or basepairs of which one or several 
bases or basepairs differ from the target locus. 

19. A method according to claim 1 wherein th targeting DNA is constructed in such a way to 
allow the generation of deletions of chromosomal regions as follows: 

- The targeting construct consists of any selectable marker gene flanked by two sequences 
which can recombine with chromosomal loci: one flanking sequence being identical or highly 
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. homologous (>95''/o sequence identity) to a sequence of the genome of the target cell, the 
other flanking sequence being a so caJled repetatlve sequence (e.g. Alu. LINE, SINE) of 
which numerous diverged copies are spread over the genome. 

- The targeting construct consists of any selectable marker gene flanked by two sequences 

one being a sequence that can act as a telomere, the other.flanking sequence being a so ' ' 
called repetative sequence (e.g. Alu. LINE. SINE) of which numerous diverged copies are 
spread over the genome. 

20. A method to test inactivation of the cellular mismatch repair system related to claims 
2.3.4.5.6 using a gene targeting assay involving comparison of the targeting efficiency of 

isogenic versus nonisogenic DNA targeting constructs. 

21. A method to test inactivation of the cellular mismatch repair system related to claims 
2.3.4.5.6 using an assay involving intrachromosomal recombination between homologous 
but diverged DNA sequences. 

22. A method to test homologous recombination related to claims 13.14,15.16.17 1819 using 

an Embryonic Stem cellline or any other cell line in which the mismatch repair system is 
inactived by disruption of both copies of the gene or another mismatch repair gene. 

MMMIiiil^^ 

23. A method related to claims 8,9,10 in order to derive cells or cell lines from mismatch repair 
defeaent mice carrying a disruption in both copies of the /Ws/z2gene or another mismatch 
repair gene. 
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